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@ Antlgenically active proteins and peptides, and transmissible gastro-enteritls virus TQEV vaccines. 


@ The invention relates to new antigenic proteins 





which can be used to stimulate the immunity of pigs 
against transmissible gastro-enteritis virus (TGEVV 
vaccines, and to fragments or derivatives thereof. 
The invention also relates to vaccines which com- 
prise such antigenic proteins or peptides. 

The nucleotide sequence and amine acid se- 
quence of a gene coding for protein of an TGEV- 
strain are indicated in figure 1. 
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New antigenlcally active proteins and peptides, and transmissible gastro-enteritis virus (TGEV)- 

vaccines. 



The invention relates to a new antigenic protein 
which is capable of stimulating the immunity of 
pigs against transmissible gastro-enteritis virus 
(TGEV), and to fragments or derivatives thereof. 
The invention further relates to vaccines which 
comprises such an antigenic protein or peptide. 

Transmissible gastro-enteritis is an infection 
disease of the intestine In pigs causing high mortal- 
ity in piglings younger than 2 weeks and hence 
great economic losses to the pig breeders. Inspite 
of the existance of various types of vaccines, the 
problem of TGE has not been solved effectively. 
Therefore, the development of new. better vaccines 
against TGE is urgently necessary. 

The cause of TGE is a virus which belongs to 
the group or corona viruses. Corona viruses are 
positive single-stranded RNA viruses having a lipid 
envelope. The TGEV genome RNA has a length of 
approximately 23,600 bases. The virus particle 
(virion) comprises three Important proteins, namely 
the pepiomeric protein (S), the nucleocapside pro- 
tein (N), and the matrix protein (M). 

The characteristic protrusions at the surface of 
the virion are formed by the pepiomeric protein (S). 
Each peplomer is formed by a giycopolypegtide. 
The protein is fixed in the membrane by means of 
a terminal membrane anchor sequence. It Is known 
from literature that antibodies generated against the 
pepiomeric protein of the said virus, provide pro- 
tection to the pig. 

According to the present invention the 
nucleotide sequence and the amino acid sequence 
of the TGEV Purdue strain were determined. 

The invention therefore relates to proteins and 
fragments thereof which comprise at least one of 
the areas which play a part in the protection 
against TGE infections. 

Theoretically, such fragments can be used as 
such as an antigen for vaccinating against TGE. 
However, it Is to be preferred to couple said frag- 
ments to a suitable carrier. 

The peptide fragments or proteins according to 
the invention which comprise at least one antigenl- 
cally active peptide fragment can be prepared ac- 
cording to methods known for the preparation of 
peptides and proteins. 

Rrst of all, the peptides can be prepared syn- 
thetically by means of known techniques starting 
from the individual amino acids or smaller peptide 
fragments. 

The peptides can also be obtained biosyntheti- 
cally using recombinant DNA techniques and ex- 
pression systems, for example, by: 



a) transfomiation of host cells with an ex- 
pression vector which comprises a DNA, coding for 
an (the) antigenic determinant(s} (peptides in gen- 
eral); 

5 b) expressing the genome inserted in the 

expression vector; 

c) harvesting the cell culture, and 

d) separating the syntheslsed peptide 
(protein). 

70 The invention therefore also relates to a meth- 
od of preparing a DNA molecule which codes for 
peptides according to the invention. Such a method 
comprises the folilowing steps: 

a) isioaton of TGE single-stranded RNA; 

75 b) synthesis of a cONA strand complemen- 

tary to the RNA strand mentioned in a); and 

c) removal of the RNA molecule and synthe- 
sis of a second cDNA strand, using the first cDNA 
strand as a template. 

2o The double-stranded DNA obtained in this 

manner may be incorporated in an expression vec- 
tor in known manner so that a recombinant expres- 
sion vector is formed. This vector may be intro- 
duced into a suitable host cell, for example, by 

25 transformation. 

The invention will now be described in greater 
detail with reference to the ensuing specific exam- 
ple. 

30 

EXAMPLE 

a) Virus and tissue culture. 

35 TGEV strain Purdue was purified twice by 
means of pricking of a plaque on PD5 cells. Culture 
conditions were described previously in L Jacobs 
et al. J. ViroL 57; (1986); 1010-15. 

40 

b) Isolation of subgenomic mRNA's 

PD5 cells (9 x 100 were infected with an infec- 
tion multiplicity of 15. Seven hours after infection 

45 actinomycine D was added and 10 hours after 
infection the cells well lysed and the RNA ex- 
tracted, as described previously by W.J.M. Spaan 
et aL, Virol. 108 (1981); 424-434. The RNA (560 
HQ) was dissolved in a buffer having a high salt 

50 concentration (10 mM Tris-HCI pH 7.5, 0.06% SDS. 
500 mM NaQ) and transferred to an oligo(dT)- 
cellulose column for a selection on poly(A)+ RNA. 
After elution with 10 mM Tris-HCI pH 7.5the RNA 
was concentrated by precipitation with ethanol and 
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fractionated by means of an isokinetic sucrose gra- 
dient (B^M- van der Zeijst at ai., In G-D. Fasman 
(Ed.). Handbook of biochemistry and molecular bi- 
ology, 3rd ed. Physical and Chemical data. Vol. 1_ 
CRC Press Inc., Boca Raton. Ra.. (1976), gg. 426- 
519). Before providing the RNA on the gradient it 
was dissolved in 50 mM Tris-HCI pH 7.4, 10 mM 
LiCI, 1% SDS and heated at 56°C for 1 minute. 
32p.|abelied TGEV mRNA's were added as mark- 
ers. The identification and translation of TGEV 
mRIMA*s is described by Jacobs et al.. 1986. 



c) Cloning of TGEV pepiomer (E2) gene. 

RNA3 obtained after purification over the su- 
crose gradient was used for the synthesis of cDNA 
(carried out as' described by Gubler and Hoffman 
(Gene, 25 (1983). 263-269). Catf-thymus pentamers 
were used as primers and methyl mercury hydrox- 
ide for denaturing \he RNA. The double-stranded 
cDNA was elongated with dC residues and then 
cloned in a pUC9 vector elongated witin dG resi- 
dues. The DMA was used for the transformation of 
E. coji strain JM109 as described by D. Hanahan et 
al.. J. Mol. Biol.. 166. (1983), 557-580. In tills 
manner a cDNA library of approximately 900 trans- 
formants was obtained. The plasmides comprise 
Insertions having a length up to 5kb. 



d> Selection of E2-spectfic recombinants. 

E2-speciftc recombinants could be identified by 
using the homology between TGEV and RPV. Two 
restriction fragments from the E2 gene of FIPV 
were labelled with ^P-^iAJP by means of nick 
translation and were used for the selection of tiie 
recombinants. Twenty-four recombinants hybridis- 
ed witii probe 2. eight with probe 1 and two recom- 
binants hybridised with both probes. A restriction 
map of three recombinants was made. The recom- 
binants which represent the whole pepiomer gene 
were used for sequence analysis. Restriction frag- 
ments were separated in an agarose gel and iso- 
lated by means of an NA45 membrane (Schleicher 
and Schuell) and cloned in Ml 3 vectors (mp8 
and/or mp19). Sequence analysis was carried out 
according to the dideoxy-nucleotide termination 
method of Sanger et al.. Proc. Nath. Acad. Sci. 
USA 74. (1977). 721-732. Both tiie universal Ml 3 
primer and E2-specific primers were used. The 
data were analysed by means of a DEC20y60 com- 
puter and the programs of Staden. Nucl. Acid. Res, 
10, (1982). 4731-4751. 



e) Nucleotide sequence. 

The nucleotide sequence of the pepiomer gene 
of TGEV (Purdue strain) is shown in figure 1. The 

5 sequence comprises an open reading frame of 
4347 nucelotides with a coding capacity for a pre- 
cursor protein of 1449 amino adds (nucleotide po- 
sition 155-4502). Upstream of the ATG codon and 
downstream of first stop codon a recurring se- 

10 quence (ACTAAACT) is present. This "intergenic" 
sequence also flanks the E2 gene of RPV arKi has 
also been found upstream of the nucieocap^d 
gene of TGEV and MHV. 

75 

- f) Amino acid sequence. 

The ATG translation initiation codon (position 
161) is succeeded by 15 amino acids which prob- 

20 ably form the cieavable signal peptide, two hydro- 
philic amino adds succeeded t)y 13 hydrophobic 
amino adds. The extreme carboxy terminal part, 
position 4319-4393 (25 amino acids) comprises a 
hydrophobic region which presumable serves as a 

25 transmembrane anchor, Ixrth regions are under- 
lined in Rgure 1. 



Claims 

30 

1 - A gene which codes for protein of transmis- 
sible gastro-enteritis virus (TGEV), characterized by 
the nucelotide sequence as shown in Rgure 1 . 

2. A protein, characterized by the amino acid 
35 sequence as shown in Rgure 1 , or a part thereof. 

3. An immunogen, characterized in that it com- 
prises a protein or peptide as claimed in Claim 2, 
whether or nor coupled to a carrier. 

4. A vacdne, characterized in that it comprises 
40 an immunogen as claimed in Claim 3. 

5. A method of preparing a protein or peptide 
as claimed in Claim 2. characterized In that 

a) the protein or peptide is synthesised in a 
manner known ger su from individual amino acids 

45 and/or by coupling smaller peptides; or 

b) a host cell is transformed with an expres- 
sion vector which comprises a DNA coding for a 
protein or peptide as claimed in Claim 2, and the 
genome introduced in the expression vector is ex- 

50 pressed. 

6. A DNA molecule which comprises a 
nucleotide sequence which codes for a protein or 
peptide as claimed in Claim 2, or which codes for a 
polypeptide which comprises a protein or peptide 

55 as claimed in Claim 2. 

7. A recombinant DNA expression vector which 
expresses the whole protiain or peptide or a part 
thereof as defined in Claim 2, comprising an 
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operon with initiator sequences and terminator se- 
quence and a nucleotide sequence which codes for 
the said protein or peptide, the said nucleotide 
sequence being situated between the initiator se- 
quence and the terminator sequence of the operon. s 

8. A host ceil which comprises a recombinant 
DNA expression vector and/or DNA molecule as 
claimed in Claim 6 or 7. 

9. A method of preparing a DNA molecule 
which codes for a protein or peptide as darned in io 
Claim 2, characterized in that 

a) single-stranded T6EV*RNA is isolated; 

b) a cDNA strand complementary to said 
RNA strand is synthesised; 

c) the RNA strand is removed and a second 75 
strand of cDNA is synthesised with the first cDNA 

as a template. 
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TTAACACACCATG AAAAAACTATTTGTGGT.TTTGGTCGTftATnr.nATTRATTTATmiA nA 
160 170 ISO 190 200 210 



NFPCSKLTNRTIGNQUNLIE 
CAATTTTCCTTGTTCTAAATT6ACTAATA6AACTATA6GCAACCAGTG6AATCTCATT6A 
220 230 240 250 260 270 



TFLLNrS, SRLPPNSDyVLGO 
AACCTTCCTTCTAAACTATAGTAGTAGGTTACCACCTAATTCAGATGTGGTGTTAGGTGA 
2S0 290 300 310 320 330 



YFPTVQPUFNC IRNNSNDLY 
TTATTTTCCTACTGTACAACCTTGGTTTAATTGCATTCGCAATAATAGTAATGACCTTTA 
340 350 360 370 330 390 



VTLENLKAyVUDrATENl TW 
TGTTACACTGGAAAATCTTAAAGCAGTGTATTGGGATTATGCTACAGAAAATATCACTTG 
400 410 420 430 440 450 



N H R Q R L N y i." U N G Y P Y" S I T ^ T 
GAATCACA6ACAAC6GTTAAACGTAGTCGTTAAT6GATACCCATACTCCATCACA6TTAC 
460 470 4S0 490 500 510 



TTRNFNSAE6AIICICKGSP 
AACAACCCGCAATrTTAATTCTGCTGAAGGTGCTATTATATGCATTTGTAAGGGCTCACC 
520 530 540 550 560 570 



P T T T T E G S L T C N U G b E C R L N 
ACCTACTACCACCACAGAATCTA6TTTGACTTGCAATTGGGGTAGTGAGTGCABGTTAAA 
5S0 590 600 610 620 630 



H K F P I C P S N S E A N C G N n L Y G 
CCflTAAGTTCeCTATATGTCCTTCTAATTCAGAGGCAAATTGTGGTAATATGCTG TATGG 
640 650 660 670 680 690 



LQWFADEyVAYLHGASYRIS 
CCTACAATGGTTT6CAGATGAGGTT6TTGCTTATTTACATGGT6CTA6TTACC6TATTAG 
700 710 720 730 740 750 



F ENQWSGTVTF GDfl RAT TLE 
TTTTGAAAATCAATGGTCTGGCACTGTCACATTTGGTGATATGCGTGCGACAACATTA6A 
760 770 7&0 790 800 810 
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VABTLVDLUWFNPVYDV_SYY 
ASTCGGTS6CAC6CTT6TACACGTTT66T-G6TTTAATCCTGTTTATBATGTCAbTiftTTA 
a20 S30 S^O S50 S,60 370 

R y N N K N G T T V V S N C T D Q C A S 
TAnGGTTAATAATAAAAATGGTACTACCGTAGTTTCCAATTGCACTGATCAATGTGCTAb 
3&0 fi90 900 910 920 930 



YVANyPTTQPGGFIPSDFSF 
TTATGTGGCTAATGTTTTTACTACACAGCCAGGAGGTTTTATACCATCAGATTTTAGTTT 
940 950 960 970 930 990 

NNUFLLTNSSTLVSGKLyTK 
TAATAATTG6TTCCTTCTAACTAATAGCTCCACGTTGGTTAGT6GTAAATTAGTTACCAA 
1000 1010 1020 1030 1040 IOjO 

QPLLVNCLUl-'VPSFEEAAST 
ACA6CCGTTATTAGTTAATTGCTTATG6CCAGTCCCTAGCTTTGAAGAAGCAGCTTCTAC 
1060 1070 lOSO 1090 HOC 1110 

fcfegagfd'qcngavlnntv 
attttgttttgagggtgctggctttgatcaatgtaatggtgctgttttaaataatactgt 

1120 1130 1140 1150 1160 1170 

QVIRFNLNFTTNUQSGKGAT 
A6AC6TCATTAGGTTCAACCTTAATTTTACTACAAATGTACAATCA66TAAG6GTGCCAC 
1130 1190 1200 1210 1220 1230 

y F S L N T T 6 G i<' T L E I S C Y T S 
AGTGTTTTCATTGAACACAACGGGTGGTGTCACTCTTGAAATTTCATii I TATACAGl bAG 
1240 1250 1260 1270 1280 1290 

D S S F F S T G E I P F G M T D G P R Y 
TGACTCGAGCTTTTTCAGTTACGGTGAhATTCCGTTCGGCGTAACTGATGGACCACGGTA 
1300 1310 1320 1330 1340 1350 

C Y y H Y M 6 T A L f< Y L 6 T L P P S V 
CTGTTACGTACACTATAATGGCACAGCTCTTAAGTATTTAGGAACATTACCACCTAGTGT 
1360 1370 1380 1390 1400 1410 
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'^^lAISKUGHFYINGYNFFS 
CAAGGA6ATT6CTATTAGTAAGTGG6GCCATTTTTATATTAATGGTTACAATTTCTTTA6 
1420 1430 1440 1450 1460 -1470 

^^•^PIDC I SFNLTT6DSDVFU 
LACATTTCCTATTGATTGTATATCTTTTAATTTGACCACTGGTGATAGTGACGTTTTCTG 
14fi0 1490 1500 1510 1520 1530 

TIAYTSYTEALVQVENTAIT 
6ACAATAGCTTACACATC6TACACTGAA6CATTAGTACAA6TTGAAAACACA6CTATTAC 
1540 1550 1560 1570 1530 1590 

KVTYCNSHVNNIKCSQITAN 
AAAGGTGACGTATTGTAATAGTCACGTTAATAACATTAAATGCTCTCAAATTACTGCTAA 
1600 1610 1620 1630 1640 1^50 



L N N" G F y P V =; S 5 £ U G I. «/ *v K 5 V/ 
TTiGAftTAATGGATTTTflTCCTGTTTCTTCAAGTGflAGrTGGTCTTGTCAATAAGAGTGT 
1660 1670 1630 i£,90 1700 l/lo 

VLLPSFYTHTIVNITIGLGtt 
TGTGTTACTACCTAGCTTTTACACACATACCATTGTTAACATAACTATTGGTCTTGGTAT 
1720 1730 1740 1750 1760 1770 

KRSGYGQPIASTLSNITLPM 
GAAGC6TA6T66TTAT66TCAACCCATA6CCTCAACATTAAGTAACATCACACTACCAAT 
1780 1790 1800 1810 1820 1830 

QDHNTOVYCIRSDQFSyYt^H 
GCAGGATCACAACACCGATGTGTACTGTATTCGTTCTGACCAATTTTCAGTTTATGTTCA 
1840 1850 1860 1870 1830 1S90 

S T C K 3 A L W D N I F K R N C T D v' L 
TTCTACTTGCAAAAGTGCTTTAT6GGACAATATTTTTAAGCGAAACTGCAC66AC6TTTT 
1900 1910 1920 1930 1940 1950 

0AT AVIKT6TCPFSFDKLNN 
AGATGCCACAGCTGTTATAAAAACTGGTACTTGTCCTTTCTCATTTGATAAATTGAACAA 
1960 1970 19S0 1990 2000 2010 

Jt.Jz^ TFNKFCLSLSPyCANCKF 
TTACTTAACTTTTAACAAGTTCTGTTTGTCGTTGAGTCCTGTTGGTGCTAATTGTAAGTT 
2020 2030 2040 2050 2060 2070 
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D V A _A f< _ if'_ R T E Q V V R 5 L V V I y 

TGATGTAGC l GuCCG l ACAAGAACCAATGAGCAGGTTGTTAGAAGTTTGTATGTAATATA 
2030 2090 2100 2110 2120 2130 



EEGDNIVGi^PSDNSGUHDLS 
TGAAGAA6GAGACAACATAGT6GGT6TACCGTCTGATAATA6TQ6T6TGCACGATTT6TC 
2140 2150 2160 2170 2130 2190 

VLHLDSCTDYN IYGRTGVGI 
AGTGCTACACCTAGATTCCTGCACA6ATTACAATATATATGGTAGAACTGGTGTTGCTAT 
2200 2210 2220 2230 2240 2250 

,IRQTNRTLLSGLYY TSLSGD 
TATTA6ACAAACTAACAGGAC6CTACTTA6TG6CTTATATTACACATCACTATCAGGTGA 
2260 2270 2230 2290 2300 2310 



L LGFKNySDGVI YSVTP CDV 
TTTGTTAGGTTTTAAAAATGTTAGTGATGGTGTCATCTACTCT6TAACGCCATGTGATGT 
2320 2330 2340 2350 2360 2370 



S AQAAVIDGTIVGAITSINS 
AAGC6CACAAGCA6CTGTTATT6ATGQTACCATA6TTGGG6CTATCACTTCCATTAACA6 
2380 2390 2400 2410 2420 2430 



t I. L_ G L T H W T r r p is; p Y y Y § ^ v 
! GAAC FGT J AGGTCI ■^■'LMCfWJOGPL^f^^CtmCnCCTanJTTlTA TTACTArrr fA TAT,:, 
^^40 2450 2^60 2470 2460 2^90 

^^TNDRTRGTAIDSNDFDrP 

TAATTACACAAAT6ATA6BACTC6T66CACT6CAATTGACAGTAATGATTTTGATTGT6A 
2^00 2510 2520 2530 2540 2550 

^^'''YSNIGyCKNGAFVFTW 
ACCTGTCATAACCTATTCTAACATAGGTGTTTGTAAAAATGGTGCTTTTGTTTTTATTAA 
^j60 2570 25S0 2590 2600 2610 

VTHSDGDgQPlSTGNVTIPT 

CGTCACACATTCTGATGGAGACGTGCAACCAATTAGCACTGGTAATGTCACGAtACCTAC 
2620 2630 2640 2650 2660 2670 

P T I S V V E y I Q y y T T P U S T 

AAACTTTACCATATCCGTGCAAGTCGAATATATTCAGGTTTACACTACACCAGTGTCAAT 
^630 2690 2700 2710 2720 2730 
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0 CSRYyCNGNPRCNKLLTQY 
AGACTGTTCAAGATATGTTTGTAATGGTAACCCTAGGTGTAACAAATTGTTAACACAATA 
2740 2750 2760 2770 2730 2790 

VSACQTIEQALAnGARLENfl 
CGTTTCTGCATGTCAAACTATTGAGCAAGCACTTGCAAT6GGTGCCAGACTTGAAAACAT 
2a00 2S10 2S20 2330 2S40 2350 

EVOSMLFySENALKLASyEA 
GGAGGTTGATTCCATGTTGTTTGTTTCTGAAAATGCCCTTAAATTAGCATCTGTTGAAGC 
2360 2&70 2S30 2S90 2900 2910 

FN S SET. LDP lYKEUPNIGGS 
ATTCAATA6TTCAGAAACTTTAGACCCTATTTACAAAGAATGGCCTAATATAGGTGGTTC 
2920 2930 2940 2950 2960 2970 

WLEGLkYILPSHNS KRKYRS 
TTGGCTAGAAGGTCTAAAATACATACTTCCGTCCCATAATAGCAAACGTAAGTATCGTTC 
29S0 2990 3000 3010 3020 3030 

AIEDLLFDKVVTSGLGTVDE 
A6CTATAGA66ACTT6CTTTTTGATAAGGTTGTAACATCTGGTTTAGGTACA6TTGAT6A 
3040 3050 3060 3070 3030 3090 

DYKRCTG6Y0IA DLVCAQYY 
AGATTATAAACGTTGTACAGGTGGTTATGACATAGGTGACTTAGTATGTGCTCAATACTA 
3100 .,3110 3120 3130 3140. 3150 

NGIMVLPGUA NADKMTnYTA 
TAAT66CATCATGGTGCTACCTGGTGTGGCTAATGCTGACAAAATGACTATGTACACA6C 
3160 3170 3130 3190 .3200 3210 

BLAGGITLGALGGGAVAI PF 
ATCCCTTGCAGGTGGTATAACATTAGGTGCACTTGGTGGAGGC6CCGT6GCTATACCTTT 
3220 3S3Q IZ^d 1250 3260 3270 

A VAVQARLNYVALQTDVLNK 
TGCAGTAGCAGTTCAGGCTAGACTTAATTATGTTGCTCTACAAACTGATGTATTGAACAA 
3230 3290 3300 3310 3320 3330 

NQQIL ASAFNQAIGNITQSF 
AAACCAGCAGATTCTGGCTAGT6CTTTCAATCAAGCTATTGGTAACATTACACAGTCATT 
3340 3350 3360 3370 3330 3390 
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CKVNDAIHQRSRGLATVAKA 
TGGTAAGGTTAATQATGCTATACATCAAAGATCACGAGBTCTTGCTACTGTTGCTAAAGC 
3400 3410 3420 3430 3440 3450 



LAKVQDVVNIQ6QALSHLTV 
ATTGGCAAAAGTGCAAGATGTTGTCAACATACAAGGGCAAGCTTTAAGCCACCTAACAGT 
3460 3470 34S0 3490 3500 3510 



QLGJNNFQAISSSISDIYNRL 
ACAATT6CAAAATAATTTCCAAGCCATTAGTAGTTCTATTA6T6ACATTTACAATAGGCT 
3520 3530 3540 3550 3560 3570 



DELSADARVORLITGRLTAL 
TGACGAATTGAGTGCTGATGCACAAGTTGACAGGCTGATCACAGGAAGACTTACAGCACT 
35S0 3590 3600 3610 3620 3630 



N A F y S Q T L T R Q A E R A S R Q L 

TAATGC.ATTTGTGTCTCAGACTCTAACCAGACAAGC6GA6GTTAGG6CTAGTAGACAACT 
3640 3650 3660 3670 3630 3690 



AKDKVNECV.RSQSQRFGFCG 
TGCCAAAGACAAGGTTAATGAATGC6TTAGGTCTCAGTCTCAGAGATTCGGATTCTGTGG 
3700 3710 3720 3730 3/40 3750 



NGTHLFSLANAAPNGPIIFFH 
TAAT6GTACACATTTGTTTTCACTCGCAAAT6CAGCACCAAATGGCATGATTTTCTTTCA 
3760 3770 3780 3790 3S00 3810 



TVLLPTAYETVTAWPGICAS 
CACAGT6CTATTACCAACGGCTTATGAAACTGTGACTGCTTGGCCAGGTATTTGTGCTTC 
3820 3830 3840 3850 3860 3870 



DGDRTFGLVVKDVQLTLFRN 
AGATGGT6ATC6CACTTTTGGACTT6TCGTTAAA6ATGTCCAGTTGACTTTGTTTC6TAA 
3880 3890 3900 3910 3920 3930 



LDDKFYLTPRTPIYQPRVATS 
TCTAGATGACAAGTTCTATTTGACCCCCAGAACTATGTATCAGCCTAGAGTTGCAACTAG 
3940 3950 3960 3970 3980 3990 



PTCTGACTTTGTTCAAATTGAAGGGTGCGATGTGCTGTTTGTTAATGCAACTGTAAGTGA 
4000 4010 4020 4030 4040 4050 
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LPSIIPDYIDINQTVODILE 
TTTGCCTAGTATTATACCTGATTATATTGATATTAATCAGACTGTTCAAGACATATTAGA 
4060 4070 4080 4090 4100 4110 



NFRPNWTVPELTFDIFNATY 
AAATTTTABACCAAATTG6ACTGTACCT6AGTT6ACATTT6ACATTTTTAACGCAACCTA 
4120 4130 4140 4150 4160 4170 



L NLT 6EID0LEFRSEKLHNT 
TTTAAACCTGACTGGTGAAATTGATGACTTAGAATTTAGGTCAGAAAAGCTACATAACAC 
4180 4190 4200 4210 4220 4230 



TVELAILIDNINNTLVNLEW 
CACTGTA6AACTTGCCATTCTCATTGACAACATTAACAATACATTAGTCAATCTTGAATG 
4240 4250 4260 4270 4230 4290 

L NRIET YVKWPUYUWLLIGL 
GCTCAATAGAATTGAAACCTATGTAAAATGGCCTTGGT ATGTGTGGCTACTAATAGGCTT 
4300 4310 4320 4330 4340 4350 

VVIFCIPLLLFCCCSTGCCG 
ABTAGTAAT(!>TTTTGC6TArr:(i.TT ArTr!rT6TTTTrirTnTTrtTART<i.r<i»f!nTTr!rTP!TF;r! 

4360 4370 43&0 4390 4400 4410 



C I6CL6SCCHSICSRRGF EN 
ATGCATAGGTTGTTTAGGAAGTTGTTGTCACTCTATATGTAGTAGAAGACAATTTGAAAA 
4420 4430 4440 4450 4460 4470 

yepiek<^hmh|si *NVNSI ic- 
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